OntoDW: An Approach for Extraction of Conceptualizations from Data Warehouses

نویسندگان

  • Tiago Outerelo da Silva
  • Fernanda Araujo Baião
  • Kate Revoredo
چکیده

Business Intelligence (BI) fosters proper decision-making in organizations, mainly by providing the means to analyze historical data stored in repositories called Data Warehouses (DW). However, formal representation of which concepts are implemented in a DW rarely exists, which would be important to clarify and semantically describe the domain concepts behind the data stored in a DW, as well as the analytical concepts that are available for the BI tools. Examples of important pieces of knowledge that are frequently hidden into the DW are: which domain concepts are available as analysis perspectives (dimensions), how the domain concepts relate to each other, which metrics (facts) are available and what do they mean, which domain perspectives are considered for each metric and how metrics may be aggregated. On the other hand, one of the relevant uses of an ontology for the Computer Science area is as a codified artifact that formally represents a shared conceptualization about a universe of discourse. Therefore, ontologies can be used to represent both domain and analytical concepts codified and stored in a DW. However, extracting these concepts from an already-inproduction DW is not a trivial task, especially in medium and large organizations, often with tens of metrics and tens (even hundreds) of dimensions and potential aggregations. In this paper, we define a set of mapping rules from DW constructs to conceptual elements (concepts and relationships), towards automatically extracting an ontology codified in OWL. The proposal was successfully evaluated in a real scenario of a Brazilian financial institution.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Three-Echelon Multi-Objective Multi-Period Multi-Product Supply Chain Network Design Problem: A Goal Programming Approach

In this paper, a multi-objective multi-period multi-product supply chain network design problem is introduced. This problem is modeled using a multi-objective mixed integer mathematical programming. The objectives are maximizing the total profit of logistics, maximizing service level, and minimizing inconsistency of operations. Several sets of constraints are considered to handle the real situa...

متن کامل

A Reliability Approach on Redesigning the Warehouses in Supply Chain with Uncertain Parameters via Integrated Monte Carlo Simulation and Tuned Artificial Neural Network

In this paper, a reliability approach on reconfiguration decisions in a supply chain network is studied based on coupling the simulation concepts and artificial neural network. In other words, due to the limited budget for warehouse relocation in a supply chain, the failure probability is assessed for determining the robust decision for future supply chain configuration. Traditional solving ...

متن کامل

Cultural Conceptualizations in Persian Language: Implications for L2 Learning

Intercultural communication is concerned with communication across cultures. Since cultures as well as languages differ from one another in significant ways, speakers conceptualize the world around them in different ways. These cultural conceptualizations form part of the collective cognition of a speech community or cultural group. This paper is an attempt to delineate some cultural schemas in...

متن کامل

Data Extraction using Content-Based Handles

In this paper, we present an approach and a visual tool, called HWrap (Handle Based Wrapper), for creating web wrappers to extract data records from web pages. In our approach, we mainly rely on the visible page content to identify data regions on a web page. In our extraction algorithm, we inspired by the way a human user scans the page content for specific data. In particular, we use text fea...

متن کامل

The Viability of Oil Extraction from Trinidad Tar Sands by Radio Frequency Heating: A Simulation Approach

Trinidad has tar sand resources of about 2 billion barrels of oil on land in the Parrylands/Guapo and Brighton areas. With an oil price of over USD 25 per barrel, the commercial extraction of oil from Trinidad tar sands is viable, but it requires a careful study. The relatively small extent of this tar sand (about 10,000 acres and with depths varying from surface to less than 500 feet) and with...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016